NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Active shooter detection and robust tracking utilizing supplemental synthetic data

Waite, Joshua R; Feng, Jiale; Tavassoli, Riley; Harris, Laura; Tan, Sin Yong; Chakraborty, Subhadeep; Sarkar, Soumik (September 2023, arXiv preprint arXiv:2309.03381)

Full Text Available
MDPGT: Momentum-Based Decentralized Policy Gradient Tracking

https://doi.org/10.1609/aaai.v36i9.21169

Jiang, Zhanhong; Lee, Xian Yeow; Tan, Sin Yong; Tan, Kai Liang; Balu, Aditya; Lee, Young M; Hegde, Chinmay; Sarkar, Soumik (June 2022, Proceedings of the AAAI Conference on Artificial Intelligence)

We propose a novel policy gradient method for multi-agent reinforcement learning, which leverages two different variance-reduction techniques and does not require large batches over iterations. Specifically, we propose a momentum-based decentralized policy gradient tracking (MDPGT) where a new momentum-based variance reduction technique is used to approximate the local policy gradient surrogate with importance sampling, and an intermediate parameter is adopted to track two consecutive policy gradient surrogates. MDPGT provably achieves the best available sample complexity of O(N -1 e -3) for converging to an e-stationary point of the global average of N local performance functions (possibly nonconcave). This outperforms the state-of-the-art sample complexity in decentralized model-free reinforcement learning and when initialized with a single trajectory, the sample complexity matches those obtained by the existing decentralized policy gradient methods. We further validate the theoretical claim for the Gaussian policy function. When the required error tolerance e is small enough, MDPGT leads to a linear speed up, which has been previously established in decentralized stochastic optimization, but not for reinforcement learning. Lastly, we provide empirical results on a multi-agent reinforcement learning benchmark environment to support our theoretical findings.
more » « less
Full Text Available
Decentralized Deep Learning Using Momentum-Accelerated Consensus

https://doi.org/10.1109/ICASSP39728.2021.9414564

Balu, Aditya; Jiang, Zhanhong; Tan, Sin Yong; Hedge, Chinmay; Lee, Young M; Sarkar, Soumik (June 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))
Battery-Free Camera Occupancy Detection System

https://doi.org/10.1145/3469116.3470013

Saffari, Ali; Tan, Sin Yong; Katanbaf, Mohamad; Saha, Homagni; Smith, Joshua R.; Sarkar, Soumik (June 2021, EMDL 2021: 5th International Workshop on Embedded and Mobile Deep Learning)

Occupancy detection systems are commonly equipped with high quality cameras and a processor with high computational power to run detection algorithms. This paper presents a human occupancy detection system that uses battery-free cameras and a deep learning model implemented on a low-cost hub to detect human presence. Our low-resolution camera harvests energy from ambient light and transmits data to the hub using backscatter communication. We implement the state-of-the-art YOLOv5 network detection algorithm that offers high detection accuracy and fast inferencing speed on a Raspberry Pi 4 Model B. We achieve an inferencing speed of ∼100ms per image and an overall detection accuracy of >90% with only 2GB CPU RAM on the Raspberry Pi. In the experimental results, we also demonstrate that the detection is robust to noise, illuminance, occlusion, and angle of depression.
more » « less
Full Text Available

Search for: All records